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Abstract 

Much recent research activity has been devoted to empirical study and 
theoretical models of complex networks (random graphs) possessing three 
qualitative features: power-law degree distributions, local clustering, and 
slowly-growing diameter. We point out a new (in this context) platform 
for such models - the stochastic mean-field model of distances - and within 
this platform study a simple two-parameter proportional attachment (or 
copying) model. The model is mathematically natural, permits a wide 
variety of explicit calculations, has the desired three qualitative features, 
and fits the complete range of degree scaling exponents and clustering 
parameters; in these respects it compares favorably with existing models. 

1 Introduction 

The topic of complex networks, more precisely the design and theoretical analysis 
of stochastic models of large graphs which differ from the classical Erdos - Renyi 
model, has attracted intense recent attention, surveyed from a statistical physics 
viewpoint in ^ 111)1 115| and from a rigorous mathematical viewpoint in [7] . 

Let us frame one aspect of this topic, by analogy. In freshman statistics we 
learn that bivariate data (e.g. heights and weights of n individuals) can be sum- 
marized by 5 summary statistics: average height, standard deviation of height, 
average weight, standard deviation of weight, correlation coefficient. And there 
is a 5-paramctcr probability model, the bivariate Normal, which (in several 
precise senses) exactly corresponds to these particular summary statistics. In 
the context of real-world graphs (where we will always regard the number n of 
vertices as large), one could analogously seek a crude statistical description by 
reporting a set of summary statistics. An evident choice is 

• d — average vertex-degree 
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and recently popular extra choices include 

• an exponent 7 characterizing power-law tail behavior of degree distribution 

• a "clustering coefficient" k measuring relative density of triangles 

• the average distance I between vertex-pairs. 

These choices reflect and seek to quantify three qualitative features claimed to 
hold in many interesting graphs (from WWW links to human social networks): 
power-law degree distribution, local clustering of edges, and diameter growing 
as O(logn). So from the viewpoint of classical mathematical statistics, it would 
be natural to seek a several-parameter stochastic model of random graphs whose 
parameters could be readily identified with summary statistics of the kind above. 
In more detail, we propose three desiderata 1 for a satisfactory model, beyond 
possessing the three qualitative features mentioned above: 

• mathematical tractability: one can find reasonably explicit formulas for a 
variety of quantities of interest 

• fitting flexibility: by varying model parameters one can vary summary 
statistics (like the 4 listed above) broadly through their possible ranges 

• naturalness: the qualitative properties emerge from some simple underly- 
ing mathematical structure rather than being forced by fiat. 

Unfortunately no satisfactory such models are known. The statistical physics 
literature surveyed in ^ HUj starts with a few elementary model-construction 
ideas (such as the proportional attachment and small worlds models mentioned 
in section|BJ) and then explores numerous variations. Our purpose in this paper 
is to introduce a new class of model we call metric copying, 2 and to study 
a particular two-parameter model (mean-field simple copying, MFSC) within 
this class. The description and analysis of the MFSC model involve somewhat 
more sophisticated mathematical visualization than has been used in previous 
complex networks literature. So let us first address the first two desiderata by 
listing results for the model (section and only later (section describe the 
model. Section0]derives most of the formulas in section|3 and section[S]cxhibits 
further calculations. A briefer account of the model, aimed at mathematicians, 
appears in 0|. 

1.1 Some notation 

P(-) denotes probability, E(-) denotes expectation, and var (■) denotes vari- 
ance. We assume familiarity with elementary probability notions of random 
variables and their distributions. We write Geo(p), Bin(m,p), Exp(/i), Poi(?7) 

1 From an applied viewpoint, one could regard "fitting empirical data" as the single crite- 
rion; we are of course taking a theoretical viewpoint 

2 We use mathematical terminology: a metric is a distance function. Confusingly, some 
engineers use "metric" to mean "summary statistic" 
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for the geometric, binomial, exponential and Poisson distributions in their usual 
parametrizations, reviewed below. We employ a "blackboard shorthand" of also 
writing Geo(p) etc for a random variable with that distribution. Thus the el- 
ementary reproductive property of the binomial distribution could be written 

as 

Bin (mi + m2, p) = Bin (mi, p) + Bin(m2,p) 

where the random variables on the right are independent, and where = means 
equality in distribution. The point of this notation is that, analogous to "compo- 
sition of functions" in which we interpret exp((x — l) 2 ) as the composition of the 
two functions exp(x) and (x — l) 2 , we can "compose" (statisticians say "mix") 
distributions. For instance (cf. (JSJ) below), given a random variable A with 
values in (0,1) we can write Geo (A) for a random variable whose conditional 
distribution given A = p is the Geo(p) distribution. 
Review of elementary distributions. 

P(Gco(p)=i) = (l-py-'p, 7 = 1,2,... 
PGeo(p) = p- 1 

P(Bm(m,p) = i) = Qp'a-pr.^O,!,...^ 

PBin(m,p) = mp (1) 
var Bin(m,p) = mp(l — p) (2) 
P(Poi(7,) = ?,) = e^rf/il, i= 0,1,2,... 
EVo\(vi) = rj. 

The Exp(/j) distribution has probability density function and expectation 

f(x) = /Je _M2: , < x < oo 
EExp(n) = fT 1 . 

A Poisson process of rate 1, say (0 < £i < £2 < £3 < • ■ ■), is defined by the 
property 

£1, £2 — £i> £3 — • • ■ are independent with Exp(l) distribution 
and has the property 

P(some £j G [x, x + dx\) = 1 • dx, < x < 00. (3) 



1.2 Organization of paper 

Because the precise definition and a priori motivation of the model are lengthy 
to explain, we start by emphasising the a posteriori motivation, the fact that 
the model permits many explicit calculations. In first reading the formulas in 
section|21 focus on the left sides of equations, indicating what quantities can be 
calculated. The formulas on the right sides will be derived in section 0] 
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2 Formulas 



2.1 Key methodology 

Like other models involving vertices arriving and creating edges to existing 
vertices, the MFSC model defines a directed acyclic (no directed cycles) random 
graph Q n on n vertices. A key feature of the model is that there exists a 
well-defined limit infinite rooted graph Q*^ which represents the n — > oo limit 
of Q n rooted at a uniform random (we say "typical") vertex. So for "local" 
statistics of Q n , one can give "exact formulas in the n — > oo limit" by doing 
calculations within the limit structure Q*^ , and this methodology is how we will 
derive (section 0} and interpret the formulas in sections 12 . 21 - 12 . til below . 

Note that "rooting" is introduced merely as a convenient technical way to 
deal with infinite graphs. By analogy, one could study two-dimensional space 
without introducing the origin point (cf. Euclidean geometry) but for many 
purposes an origin and induced coordinate system are helpful. 

The MFSC model has two parameters: a, A. In explicit formulas, we distin- 
guish between a low clustering region defined by parameter ranges 

< a < 1, 0<A<l/a [low] (4) 

and the complementary high clustering region defined by a A > 1; in the latter 
case it is convenient to reparametrize by using r\ :— A -1 log(aA) in place of a, 
and the parameter ranges are 

0<?/<l, 77 + 1/A < 1. [high] (5) 

This distinction is purely notational; there is no intrinsic "non-analyticity" in 
the model's properties. 



2.2 The two parameters control mean degree and cluster- 
ing 

(a) . First consider Z3j n and -D utj the random in-degree and out-degree of a 
typical vertex. Then 

f [low] 
ED m = ED out (= 8, say) = j r,+i/A^ j Mg ^ (6) 

(b) . Second, define a normalized clustering coefficient Cluster in words as 

The proportion of directed 2-paths vi — > v 2 — > 1*3 for which vi — > w 3 
is also an edge. 



(see H44fl for a more precise definition and derivation of (JTJ). Then 



By solving H(jl7fl we find (section 14. tj|) that every pair of values of d, K c iustcr in 
the complete range 

< d < 00, < Cluster < 1 

occurs for a unique parameter pair (a, A) or (?y, A). Moreover the two regions 
can be specified as 

< d < oo, < K clust0 r < t low ] 

< 9 < OO, ^ < ^cluster < 1 [high] 

explaining our low and high clustering terminology. So the two model parameters 
a, A have fairly direct interpretations in terms of mean degree and clustering; 
of course we could re-parametrize the model in terms of d and Cluster, but the 
internal mathematical structure is more conveniently expressed using the given 
parameters. 

2.3 Distributions of in- and out-degrees 

(a) . The distribution of D m is specified as 

1 + D in = Gco(e~ /3T ) where T = Exp(l) (8) 

and where 

rj + l/X [high]. 



0=1 a [l0w] (9) 



This works out explicitly as 

v m ' /3 2 r(d + 2 + i) - v ' 

with asymptotics 

p(D in = d) ~ r 2 r(i//?) <r 1- ?. 

Formula (|ll|l appears as a special case of recent results in two-parameter pro- 
portional attachment models 111! IT5| , but in fact is a famous 80-year old 
calculation - see section |4~21 

(b). The distribution of D ou ^ is determined by the distributional equation 

D d j ££iBin(l+A,aAe-*<) [l ow ] 

1 EE^l + £0 + E~iBin(l + A,e-^) [high] 

where D, Di,D[, i > 1 are i.i.d. random variables distributed as -D ou t and 
where < £i < £2 < ■ ■ ■ are the points of a rate-1 Poisson point process on 
(0,oo). 
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We do not know how to extract a useful explicit formula from (|12[1 but we 
can compute moments. For instance 



var D out 



a(l-a+g 2 A/2) 
(l-a) 2 (l-±a 2 A) 
(r,+ J x )(2-r,-l) 
(l-»?-^)(l-r ) -i) 2 



[low] 
[high] 



(13) 



In the case A = 1/q and in the limits A — > 0, A — > oo we get explicit formulas 
for the distribution of -D ou t - see section 14.31 - which show in particular that 
the tail of -D ut has geometric rather than power-law decay. 
(<•)• 

D- m and -D ut are independent. (14) 

Because both D ov ^ and D- m can take the value 0, we see that P(D^ n +D ou i = 
0) > 0, implying that Q n will typically not be connected (see section l2~7l for 
further comments). 



2.4 Densities of induced subgraphs 

One of the major advantages of the model is that, for a fixed "small" graph G, 
one can (in principle, and often in practice) calculate explicitly an "asymptotic 
density" denSoo(G) interpreted as the limit 

number of copies of G in Q n 
lim = denSoo(Cr). 

n — >oo Ti 

Precise definitions are fussy, and are deferred to section l4~4l which also records 
the explicit formulas we have found. Here let us point out the formula for 
triangles K 3 : 



„3 



A 



den Soo (Jf3) = I f + a l\f+$ . ' (15) 

The formula above is the key ingredient in the formula for Cluster- Recall the 
verbal description of K c i us t e r: 

The proportion of directed 2-paths v± — > «2 — > Va for which v% — > V3 
is also an edge. 

It is intuitively clear (and formalized at (|40|) that the asymptotic density for 
occurrence of directed 2-paths v\ — > vi — > V3, if one does not look whether 
or not a third edge v\ — > V2 is present, equals d 2 (because of independence of 
in-degree and out-degree at i^)- So the verbal definition translates to 

_ dens^i^) 

^cluster 



[low] 



Q2 

and then ljl5|l immediately gives formula 10 for K c iustcr- 
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2.5 Triangle density as a function of degree 



The parameter K c i us ter gives an overall measure of triangle density. A more 
detailed description is provided by statistics C(k), k > 2 defined by 

. EYnumber of triangles containing a random degree-k vertex) 

c(k) = (5 ■ 

In principle the methods of this paper could be used to obtain an exact formula 
for C(k), but we shall be content with outlining (section 153)1 the tail property 

~ r^ 2 r x i as fc — > oo. (16) 
See section for further comments. 



2.6 Edge-lengths 

Our model has a "metric structure", in that there is a distance c? motric (u, w) 
between any two vertices which does not involve the realization of edges in 
the random graph. So each edge (v,w) of the graph has a real-valued length 
rfmetric(^ , w) , and so a typical edge has a random length L. The probability 
density function for L is given by the formula 

... l-a^ (» + l)r(a + 3) (-Ax)* 

f(x) = ' ) ; \ '-, 0<x<oo [low]. (17) 

a •f— 1 1 (i + a + 6) 

2 — 7 

Matriematica gives an equivalent expression as a sum of incomplete hypergeo- 
metric functions. One can readily observe that f(x) = exp(— (A ± o(l))x) as 
x —> oo. In the underlying metric space, the number of vertices within distance 
a; of a typical vertex grows as e x . So the tail behavior of f(x) suggests 

the chance that a vertex has an edge to its k'th nearest neighbor 
should scale as /c~ A_1 

though we have not attempted detailed calculations to verify this suggestion. 
Note this property appears without being explicitly built into the model. 



2.7 Other local statistics 

There are further questions, concerning exact behavior in the n — > oo limit, 
which are in principle solvable in terms of the limit network Q*^, but where we 
have been unable to obtain usefully explicit answers. A major question concerns 
the percolation probability 

_Pperc(a, A) = P(typical vertex is in infinite connected component of G^) 

(18) 
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By analogy with classical facts about the Erdos - Renyi model, we expect that 
above the percolation threshold, that is when Pperc(a, A) > 0, the random graph 
Q n will have a giant component whose size C„(a, A) satisfies 

n~ l EC n (a, A) -> p pe rc(a, A). 

Unfortunately we do not see how to write pperc(a, A) as a solution of any simple 
equation. By studying an easier-to-analyze directed percolation problem, it is 
not hard to show (section l5.3fl 

if 2/3 - f3 2 > 1 then p pe rc(a, A) > 0. (19) 

2.8 Average distance 

In any graph, write G? graph (i>, w) for the minimal number of edges in any path 
from v to w. The diameter A and the average vertex-vertex distance A are 
defined by 

A = max d siaph (v , w), A = a,ve v , w d sraph (v, w). 
In the context of a simple proportional attachment model it is known [H] that 

(1 + o(l))logn 
EA n , EA n = v T , as n -> oo. 

log log n 

It is natural to conjecture, but hard to prove, the same result for our model 
(above the percolation threshold and restricted to the giant component). On 
the other hand it seems likely that standard techniques of abstract mathematical 
probability would be enough to show the weaker bound EA n — 0(log n) as n — > 
oo. Such questions cannot in principle be answered completely using Q*^. 

2.9 Summary of advantages and disadvantages of the model 

The previous sections convey some advantages of the model: 

• it has the three qualitative features desired in a complex network model 
(power-law degree distribution, clustering, small diameter); 

• it fits the complete possible range of mean degree (or scaling exponent) 
and clustering parameters; 

• it permits a broad range of explicit calculations. 

So to be fair let us list some disadvantages from a modeling viewpoint. 

• Q n is not connected (for large n); cf. section |2~71 

• there is no power law for distribution of out-degree; 

• in-degree and out-degree are independent; 
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• the scaling exponent for in-degrec is determined by the mean degree; one 
might prefer a model where these could be specified separately; 

• in the n — > oo limit not every finite graph is possible as an induced sub- 
graph (section |OJ. 

3 The model 

3.1 Metric copying models 

Let us briefly outline a general modeling framework, metric copying models. 
Each vertex v is a point in a metric space; that is, there is some real-valued 
distance d(v, w) between any two vertices v, w. Given some rule for the positions 
of successive vertices 1,2,.. ., and given a function p : [0, oo) — * [0, 1], we can 
construct random directed graphs Q n inductively on n as follows. When vertex 
n arrives, then 

(i) for each directed edge (i,j) of G n -l> a "copied" edge (n,j) is created with 
probability p(d(n, i)); 

(ii) for each vertex i (1 < i < n), a new edge (n,i) is created with probability 
p(d(n,i)); 

(iii) the events above are independent, except that repeat edges are censored. 

Imagine p(-) to be rapidly decreasing. A moment's thought shows how this 
model resembles proportional attachment models. An existing vertex v with in- 
degree d has d + 1 opportunities to acquire an in-edge, due to the next arriving 
vertex being close to v or close to one of the d vertices with edges to v. 

In principle one could study such models based on random points in d- 
dimensional space, but within such settings it is notoriously hard to do explicit 
calculations (see e.g. JB] for different models of random graphs based on d- 
dimensional random points), and the choice of d is arbitrary. We will avoid 
both problems by using a well known (in other contexts) model which is loosely 
interpretable as "random points in infinite-dimensional space". Note that in 
d-dimensional space, the number of points within distance r of a typical point 
grows as r d \ what will make our model "infinite-dimensional" is that this number 
grows as e r . 

For later use recall that a pre-metric d(i,j) is symmetric and strictly positive 
for j ^ i. A pre-metric can be used to specify a metric d(v , w) as the minimum, 
over paths v = io,i%, . . .,ik = w, of d(i , h) + d(h,i 2 ) + ■ • • + dtyk-uh)- 

3.2 A (i-dimensional analogy 

As a final preliminary, the following analogy may be helpful. In d-dimensional 
space R d , take a cube [— n 1 / d /2, n 1 ^ d /2] d of volume n, and put n uniform random 
points in that cube. This structure has a n — > oo limit, the Poisson point 
process in R d with mean intensity 1 point per unit volume. Moreover the limit 
process, which is a spatial point process on all of R d , can be represented as the 
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distribution, at any fixed time, of a time-evolving process of points on all of R , 
where the evolution rules are 

(i) points move away from the origin as deterministic motion with exponential 
rate 1/d; a point at position x at time t will be at position xe^ ~^/ d at times 
t' > t. 

(ii) New points arrive throughout R d as a rate-1 space-time Poisson process; 
that is, the chance of a point arriving in a cube of volume dx during a time 
interval dt equals 1 • dx dt. 

Thus if one takes a volume- 1 region of space at time to, this space expands to 
become volume e* at time to + t, and the arrival rate per unit time within this 
expanding volume is e* at time to + t. 

In the limit process, one may regard the "present time" as time 0, and regard 
the process as having evolved 3 over time — oo < t < 0. Particles at the present 
time have ages which are independent Exp(l) random variables independent of 
present positions; from the present configuration of positions and ages one can 
deterministically reconstruct the past evolution of the process. 

3.3 The stochastic mean-field model of distance 

Our model of an underlying metric space is specified by three rules. 

(i) Point n arrives at time t n = log ri. 

(ii) At the arrival time i„, define the pre-distances (D(n, j; t n ), 1 < j < n — 1) 
from n to the earlier-arriving points to be independent random variables with 
exponential, mean n, distribution. 

(iii) Distances grow exponentially with time; D(n,j;t) = e t_t ™ D(n, j;t n ) for 
t > t n . 

So at time t there are n = [e*J points, and the pre-distances D(i,j;t), 1 < 
i < j < n) are independent random variables with exponential, mean e*, distri- 
bution. These particular pre-distances are an instance of a pre-metric, and this 
pre-metric specifies a metric D{i,j;t), 1 < i < j < n). Write (T> t ,0 <t< oo) 
for this process of arriving points and distances. 

Here is a key feature of this construction. At time t pick a uniform random 
point V t as a "root" . Then there is a t — > oo limit (in distribution) structure, 
which is a metric space on a countable infinite number of points, one being 
distinguished as the root. The limit structure, called the PWIT, is described 
below. The meaning of "limit" is that, for arbitrary fixed r < oo, the config- 
uration of points in T> t within distance r of Vt converges in distribution to the 
configuration of points of the PWIT within distance r of the root (this is local 
weak convergence of random networks j^l). 

3 This model is reminiscent of the steady-state theory of the Universe advocated by Fred 
Hoyle in the 1950s 
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3.4 The PWIT 



The PWIT is defined by a construction, illustrated in Figure l 4 . Start with 
a single root vertex 0. This root vertex is then given an infinite number of 
near neighbors, and the edges from the root to the near neighbors arc assigned 
lengths according to a realization of a Poisson process : 1 < i < oo) of rate 
1 on (0, oo). Now, recursively, each vertex v arising as a near neighbor of a 
previous vertex is given an infinite number of near neighbors, and the edges to 
these near neighbors of v are again assigned lengths according to an independent 
realization of a Poisson process (£" : 1 < i < oo) of rate 1. This procedure is 
then continued ad infinitum. The resulting rooted infinite tree is a well defined 
random object, called the Poisson weighted infinite tree (PWIT). 




Figure 1. The PWIT. Illustration of the vertices of the PWIT within a window 
of radius 3 centered on the root 0. Lines indicate the near neighbor relationship, but 
are drawn only when both end-vertices are within the window. Thus the four near 
neighbors of shown are at distances < £f < £f < £3 < £4 < 3 from 0, while there 
are an infinite number of near neighbors of at distances greater than 3. Orientation 
of lines in pictures is arbitrary. Labels a, b, c are included for later comparisons. 

4 Our figures arc illustrations of the definitions, rather than honest Monte Carlo simulations 
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The distance D(v,w) between two vertices of the PWIT is just the sum of edge- 
lengths along the path from v to w. Though we have drawn a tree in Figure 1, 
the lines merely indicate the near neighbor relationships; it is better to think 
of the edges as absent while retaining the distances D(v,w). In this way we 
may regard the vertices of the PWIT as an infinite-dimensional analog of the 
d-dimensional Poisson point process in section EOl Formula (|29|) later provides 
one formalization of "infinite-dimensional" . 

The survey |Hj gives a careful explanation of how the PWIT arises as a 
limit of finite models such as T> t , and gives some applications to combinatorial 
optimization 5 . The key point is that, for an arriving vertex V t un.T>t, the existing 
vertices at smallest l)-distances correspond in the limit to the near neighbors 
in the PWIT. (Recall the section EP1 construction: we are repeating the "key 
feature" from the last paragraph of that section.) 

In the present setting, each point v of T>t has an "age" at time t, and in 
the limit PWIT these ages are (exactly as in section EO)) independent Exp(l) 
random variables, A v say. Thus if we write £>q for the PWIT and ^4 roo t for 
the age of the root, then (given the other ages A v also) we can reconstruct the 
time-evolution of a backwards space-time PWIT process (V*, — v4 r00 t < s < 0). 
Precisely, as s runs backwards 

(a) the edge-lengths £ decrease exponentially; at time s < the length is £e s ; 

(b) a vertex v and its incident edges are deleted at s = —A v . 

Then T>* is defined as the connected component containing the root at time s. 
This limit process relates to the finite process as follows. Let A t be the age (at 
time t) of the randomly-chosen vertex Vt at time t. Then 

(V$l -A t < s < 0) 4 (D* s , -A I00t < s < 0) as t - oo (20) 

where V t+ ' s is the configuration V t + S rooted at V t . 

There is also a forwards space-time PWIT process (T>*,0 < s < oo) specified 
as follows. Start with the PWIT T>q. At time s increases, all inter- vertex 
distances increase at exponential rate 1. For each vertex v present at time 
s, and each < r < oo, there is (as explained below) chance 1 • dr ds that 
during [s, s + ds] a new vertex v' will appear at distance G [r, r + dr] from v 
as a near neighbor of v. Along with this vertex (which has current age 0) is 
an independent copy of the PWIT rooted at v', whose other vertex-ages are 
independent Exp(l). The relation between the finite-t and the limit process is 
analogous to l|2U|l : 

(D t *{*j, < s < oo) 4 (£>*, < s < oo) as t -> oo. (21) 

Here is the calculation leading to the coefficient "1" in 

there is chance 1 • dr ds that during [s, s 4- ds] a new vertex v' 
will appear at distance G [r,r + dr] from v as a near neighbor of v . (22) 

5 See also for novel scaling exponents arising in the study of the mean-field traveling 
salesman problem. 
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In the process (T> t ), during time [t + s,t + s + ds] about e t+s ds vertices arrive; for 
each existing vertex, the chance an arriving vertex is within D-distance [r, r + dr] 
equals e~( t+s - ) exp(— re~( t+s ^)dr. So the chance in (1221 equals 

e t+s ds x e - {t+s) exp{~re- [t+s) )dr w 1 x dr ds. 

Recursive self-similarity. Implicit in the model is the fact that the "geom- 
etry" of the space seen by a newly-arriving particle v* is statistically the same 
as the geometry seen by a typical existing particle. This is the familiar PASTA 
(Poisson arrivals see time averages) property in queuing theory In particular, 
at the arrival time of v* the geometric components containing the different near 
neighbor vertices t>j.,V2, ... are independent copies of the PWIT. This recur- 
sive self- similarity property of the PWIT process is fundamental to its analytic 
tractability. 

Figure 2 and its legend may be helpful. 




Figure 2. The space-time PWIT process. Regarding Figure 1 as showing 
the PWIT at a time t+, Figure 2 shows the space-time PWIT at an earlier time t- 
at which only three vertices a, b, c of the vertices in the Figure 1 window have arrived. 
Figure 2 shows smaller windows centered on a and on c. The other vertices in Figure 
2, and the near neighbor relation shown by lines, are still present at time t+, but are 
not visible in Figure 1 because the expansion of distances has placed them outside the 
Figure 1 window. 



3.5 The MFSC model 

The process (T>t„ , = 1, 2, 3, . . .) of arrivals and inter-point distances described 
in section 13.31 defines an "underlying geometry" ; we now define the random 
graph process (Q n ,n — 1,2,3, . . .) which is the subject of this paper. Fix two 
parameters < a < oo and < A < oo. Write 

p(x) = min(l, a\e~ Xx ), < x < oo. (23) 
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We now implement a version of the "metric copying" idea from section I3~T1 Q\ 
consists of vertex 1 and no edges. When vertex n arrives at time t n = logn, 
then 

(i) for each directed edge of Q n -l, a "copied" edge (n,j) is created with 
probability p(D(n, i; t n )); 

(ii) for each vertex i (1 < i < n), a new edge (n,i) is created with probability 
p(D(n,i;t n )); 

(iii) the events above are independent, except that repeat edges are censored. 

Note that we use D instead of D in determining attachment probabilities (be- 
cause £>-near vertices at finite time correspond to the near neighbors in the 
limit PWIT). We call (Q n , n = 1, 2,3, . . .) the mean-field simple copying (MFSC) 
model. 




Figure 3. The graph process G*cc(t-) on the realization of the space-time PWIT 
at time t- in Figure 2. For the graph process we show all edges with either end- 
vertex within the window. The following figures show the evolution of Glc(t) over 
t- <t < t+. 
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Figure 4. Figures 4-6 build up the graph Q* oc (t + ) on the time-i+ PWIT in 
Figure 1. Figure 4 here shows only the edges that were present at time t_, that is the 
edges shown in Figure 3. Some edges crossing outside the window have been redrawn 
at different angles for later convenience. 




Figure 5. Suppose in Figure 1 that the center vertex arrives at time t+; Figure 5 
illustrates the graph just before that arrival. Since the time-t- configuration in Figure 
3, more vertices have arrived and formed edges, and distances have expanded. 
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Figure 6. Vertex arrives at time t + with near neighbors d,e,a, f, . . .. Out-edges 
from appear according to the metric copying scheme of section T3. II with p(x) given 
by 1231 . So for each near neighbor v and each existing edge (v,w), an edge (0, v) or 
(0,'m) appears with probability p(D(0,u)). In this realization, edges appeared to the 
near neighbors d and a, and two other edges (one from d and one from e) were copied. 

Our focus in this paper is the study of the MFSC model using its limit structure. 
Just as the t — > oo limit of the time-i "geometry" V t is the PWIT, we can 
consider Q t as a structure built over T> t , and we get a limit random directed 
graph Q* x as a structure built over the PWIT. The structure of Q*^ near the 
root is exactly the t — > oo limit structure of Gt relative to a random ( "typical" ) 
vertex Vt, and so we can obtain (in principle) a wide variety of asymptotic 
results about Qt by doing calculations upon Q*^. The only difficulty in this 
methodology is that we don't have a useful explicit description of Q*^. Instead, 
we can consider the space-time limits <|20I21[) jointly with (Qt); then in the limit 
we get the space-time PWIT processes existing jointly with a random graph 
process (Q* ao (s)), where now 5^,(0) = Q*^. The process (Q* oa (s)) evolves with s 
and the space-time PWIT process by the rules implied by (i)-(iii) above. 
Precisely, the evolution rules (illustrated by Figures 3-6) are 

when a new vertex v* arrives in the forwards space-time PWIT 
process at time s, it has near neighbors (v±,V2, ■ ■ ■) at distances 
(£i>£2> • • •); and Q^o(s) has put a random graph structure on the ge- 
ometric component containing each Uj. For each i and each directed 
edge (v{, w), a new edge (v*, Vi) or (v*, w) is created with probability 
p(D(v* ,Vi)), independently for different possible edges. 
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Recursive self-similarity. What makes this process tractable is that the 
recursive self-similarity property of the PWIT extends to the random graph 
process; each Vi defines a geometric component and a random graph on that 
component, and these are independent copies of the joint distribution of the 
PWIT and Q*^. This property is used extensively in the calculations in section 

El 

3.6 Reparametrization and extreme cases 

Although the MFSC model makes sense for the full range (0 < a, A < oo) of 
parameters, we will only consider the ranges ill 151) for which the limit mean 
degree is finite. 

Note that definition ll'MI) of p(x) can be rewritten as 

p(x) — aXe 

P(x) = { I 

where the reparametrization r\ := A -1 log(aA) when aX > 1 is used in l|25[l. In 
the two extremes of clustering, our model simplifies in different ways. For fixed 
a, when n is large and a is small, the model resembles the following variant of 
the proportional attachment model: 

An arriving vertex has a Poi(a) number of out-edges, whose end- 
vertices are chosen with probabilities proportional to 1+ in-degree. 

At the other extreme, for fixed r\ our model makes sense with A = oo, interpret- 
ing (|2~5)) to mean p(x) — 0, x > r\. In this case the model becomes, for large 
n, 

An arriving vertex v* chooses at random a Poi(ry) number of neigh- 
bors Vj, and creates edges (v*,Vj), and also copies each existing edge 
(vj, w) to a new edge (u*, w). 

Clearly in the former limit we have K c i U ster = and in the latter limit we have 

^cluster 1 • 

4 Calculations 

In section 0] we derive the formulas stated in sections 12.21 - 12.61 As described 
in section 13.51 our methodology is to regard D- ln and -D ut as the (random) 
in-degree and out-degree of the root in Q^, and to study this using the time- 
dynamics of Glo{s) derived from the space-time PWIT process and the evolution 
rules of the graph process. 



-AX 



< x < oo [low] 
< x < rj 



rj < x < oo 



[high] 



(24) 
(25) 
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4.1 Two helpful calculations 

We will make frequent use of the next lemma. 
Lemma 1 For integers u > 1 write 

oo oo 

Z {u) = J2p u &) = [™in(l,a\e-^ 



The 



-EZ^ - I U l0lU> \ 1 [1 ° W] (26) 
/u ~ 1 1 + ^ [high] W 



var 



1 ^,2 



In particular, [3 :— 0\ < 1 /or aZZ parameter values. 

Proof. We will do the low clustering density case - the high density case is 
similar. By @ the chance that some falls into an interval [x, x + da;] is dx, so 

/•oo />oo 

Pu= (a\e- Xx ) u dx = a u X u / e - uXx da; = a"A"/(uA). 
Jo Jo 

Moreover, there is a general formula for variance of a sum over a Poisson (rate 
1) process (£i): 

/>oo 

var (V^u>(£i)) = / w 2 (x) dx 
Jo 

and applying this formula for gives 

poo 

var = / (aXe- Xx ) 2 dx = a 2 \ 2 /{2\). 



We next recall a classical result. Fix < 8 < oo. Set A^(0) = 1 and let 
(N(t),t > 0) be the Yule process of rate 9, that is the Markov process which 
changes only by +1 steps and for which 

P(N(t + dt) = n + l\N(t) =n) = 0n dt. 

A textbook result (e.g. JS| sec - 5.3) says 

N(t) = Geo(e- et ). (28) 

Note that in the PWIT, if N{r) is the number of vertices within distance r 
from the root (counting the root itself), then the process (N(r),r > 0) is a Yule 
process of rate 1, because for a vertex v at distance r' < r, the chance of v 
having a near neighbor at distance G [r — r' , r — r' + dr] equals 1 • dr. So in 
particular, 

EN{r) = EGeo(e- r ) = e r . (29) 
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4.2 Distribution of in-degree 

We start by giving the derivation of 

1 + D- m = Gco(e~ /3T ) where T = Exp(l) © 

for (3 = EZ( X \ In the forwards space-time PWIT process, let N(t) be 1+ the 
in-degree of the root, when the root has age t. Thus N(t) counts the set of 
vertices v for which v — > root is an edge, or v = root. When a new vertex v' 
arrives with some v in this set as a near neighbor, at distance r, there is chance 
p(r) for the root's in-degree to increase by 1, and so from the dynamics (|22|) of 
the forwards space-time PWIT process we see that N(t) is the Yule process of 
rate 

/>OC 

0=1 p(r) dr. 



Use formula (|28[1 and the fact that the age of the root of the PWIT has Exp(l) 
distribution to obtain JSJ|. 

We can quickly use (JSJ) to calculate ED- m . 

l + ED- m = E(E(Geo(e~ 0T )\T)) 

— Ee^ T because EGeo(p) = p^ 1 

e e at 

giving ED[ n = as at (JSJ. We now calculate the distribution of D- m in the 
same way. Because P(Geo(p) > i + 1) = (1 — p)\ i > we have 

P(An>*) = EP(D in >i\T) 

= EP{l + D in >i + l\T) 

= Eil-e-^) 1 



(1 - e-^) l e- 1 dt 

/ (1 — s) l sP ds setting s = e 
Jo 



i r(i + i)r(|) 



— 7— using the Beta integral formula. 

r(i + i + 4) 



This is (HHJl, and (HH follows. 

Historical note. Yule |2U| introduced what we now call the Yule process in 
1924 in the context of a model for evolution of new species. It is interesting that 
his central mathematical results are the Geometric distribution (|28|) [his (5)] and 
the calculation starting from our © [representing, for Yule, a distribution of 
numbers of species in a typical genus] of the explicit distribution l|ll|) [his (12)]. 
After 80 years we have slicker notation but the argument is the same! Moreover 
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Yule's motivation was to find a simple model yielding a power-law distribution 
for number of species per genus, just as the motivation for the recent literature 
on proportional attachment models was to find a simple model yielding power- 
law degree distributions. 

4.3 Distribution of out-degree 

We will first derive l|12f) . Because the out-edges are formed on arrival, we may 
suppose the root of the PWIT has just arrived. Consider a near neighbor v' 
at distance r. For each out-edge of v', and for v' itself, there is chance p(r) 
that a corresponding out-edge is created at the root, giving a total number 
Bin(l + D(v'),p(r)) of out-edges, where D(v') is the out-degree of v'. The 
recursive self-similarity property (end of section l3~5jl implies that the (D(v') : 
v' near neighbor of root) are i.i.d. random variables distributed as -D ut> and 
independent of their distances from the root. Rewriting (D(v')) as (£>W) 
in increasing order of distance from root, 

oo 

D ont = Y / V™(l + D {t) ,p(t t )) (30) 
»=i 

which becomes lJT2]l. 

We now turn to the issue of using (|30l) to get information about the distri- 
bution of D on i. Because a directed edge contributes equally to total in-degree 
and to total out-degree, we know a priori that ED ou ^ must equal ED- m , but let 
us first check that we can indeed use l(T2*|) to show ED on ^ = (3/(1 — (3). Because 
l[T]l EBm(n,p) — np we see 

£Bin(l + = (1 + ED out ) Ep(ii). 

So (EOJ) gives 

ED out = (1 + ED out ) ■ EZV> = (1 + ED out )f3 

giving ED out = 13/(1 - 13). 

Variance. The calculation of the variance var -D ou t provides a textbook 
illustration of the utility of the general conditional variance formula 

var X = Ev&r (X\Y) + var E(X\Y). 

We give the details in the low density case; the high density case is similar. In 
the defining equation (|12fl write D for D ou ^ and write D and 3 for the random 
sequences (Dj) and Because var Bin(n,p) = np(l — p) we have 

var (£>|D,S) = ^(1 + D^aXe'^ (1 - aXe'^). 

i 

Recursive self-similarity, as used above, implies independence of the i.i.d. se- 
quence (Di, i > 1) and the Poisson process (£;, i > 1). So 

.Bvar (Z>|D,S) = (1 + ED)(EZ {1) - EZ {2) ) 
a (l - sA) 

= — 2— using © and ©■ (31) 
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Next consider the conditional expectation 

E(D\D, 3) = ^(1 + D^aXe-^' = W, say. 

i 

We will calculate var W by using the conditional variance formula. Because 
var (W\E) = ^](var D) • a 2 A 2 e" 2A ^ 

i 

we have 

£var (W\E) = (var D) ■ EZ {2) = (var D) ■ a 2 X/2. (32) 
And since E(W\E) = (1 + ED)Z^ = j4^^ (1) we have 

var £(W|H) = 1 var Z« - — — (33) 
(1 — a)- 2 2(1 — a) z 

Using the conditional variance formula twice 

var D = Evai (D|D, 3) + var W 

= £var (D|D, 3) + £var (W|S) + var £(W|3) 

= ^^ + (varD).^A/2 + -^. 
1 — a z(l — a) z 

Solving gives the equation i|13|) for var D ou t- 

Special cases, (a). Fix a. Because p(x) < aA, in the A — > limit we 
can apply the Poisson limit of Binomials result to the defining equation 1)12(1 to 
obtain (cf. section l3~?)f 

(in A -> limit) D out = Poi(a). (34) 



(b) . Fix rj. In the A — ► oo limit we can use the limit process of section 
to show that 1 + -D ut nas the distribution of the total population size in a 
Galton- Watson branching process 6 with Poi(^) offspring distribution. This is 
(see e.g. |S]) the Borel-Tannerfjj) distribution 

(nd) d ~ 1 e~ r > d 

(in A -> oo limit) P(l + D out = d) = - ' „ , d > 1. (35) 

a! 

(c) . In the case aA = 1 it turns out (an argument is sketched in section T5. 41 

1 + Amt = Geo(l-a). (36) 

Independence of in- degree and out-degree. This independence, noted at (|14|) . 
follows from the fact that in the forwards space-time PWIT process the out- 
degree of the root is determined at the arrival time of the root vertex; the 
subsequent evolution of the process of in-edges is clearly independent of the 
state of the graph immediately after arrival. 

6 A population process starting with one individual in generation 0, individuals having i.i.d. 
random numbers of offspring in successive generations 
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4.4 Densities of induced subgraphs 

Here we give details of the definition and interpretation of "density of induced 
subgraphs" mentioned in section |2~H and list explicit formulas. 

Let G and Q be finite directed acyclic graphs; think of G as small and Q as 
large. Define "density of G as an induced subgraph of Q" by 

j n~,\n\ #i v c S ■■ V isomorphic to G} 

deas(G\G) = — — ^ 

#{ vertices ol Q\ 

where # denotes cardinality ( "number of ) and V denotes a vertex-subset of Q 
with its induced subgraph. See Figure 7, where there are 3 such vertex-subsets 
{a, 6, e}, {b, c, e}, {c, d, e} and so where dens(G\G) — 3/5. 

. J . G l . . G 2 , root 



• • root • 

b g d 

Figure 7. Two rootings Gl , G%, of a graph G. 

We want to study n — ► oo limits of dens(G|0 n ) in our MFSC random graph 
model (£/ n ), for fixed G. To use our methodology we must first rephrase the 
definition of dens(G|C?) in terms of the rooted graph Q* obtained by giving Q a 
uniform random root. For such Q* , and for an arbitrarily-rooted directed graph 
G* , define a random variable 

X(G*|(?*) = number of vertex-subsets V of Q* including the root 
such that Q* restricted to V is isomorphic to G* via a root-preserving 
isomorphism. 

The randomness arises only from choice of root of G*\ notation X has no special 
significance except to distinguish this from simpler random variables. 
It is an easy fact that 

dens(G|C ? ) = (37) 

where G* is G with an arbitrary choice of root, and where t(G*) > 1 is the 
number of different root-choices which would give a rooted graph isomorphic to 
this particular choice. Rather than write a formal proof, let us just illustrate 
identity (|37l) using Figure 7. For the choice of root giving G*, the number of 
isomorphic vertex-subsets V of G* equals 2 (resp. I) if the root of G* chances 
to be b (resp. d), and so EX.(G\\Q*) = 3/5. For the choice of root giving Gj, 
the number of isomorphic vertex-subsets V of G* equals 1 (resp. 2, 3) if the root 
of G* chances to be a (resp. c, e), and so E^G^G*) = 6/5. Since t(G*) = 1 
while t(G|) = 2, we have checked identity l|37|) in this example. 

Note that in Figure 7, the induced subgraph on {a, 6, c} is not isomorphic to 
G because of the extra edge a — > c. Obviously we can make parallel definitions 
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allowing extra edges (pedantically: replace "isomorphism" by "vertex-bijection 
and edge-surjection") and we write dens (G\G) and X(G*|C/*) in this setting. 
For instance, in Figure 7 we have dens (G\Q) = 4/5 and EX(Gl\G*) = 4/5. 

The point of all this is that the definition of X.(G*\Q*) makes sense when Q* 
is a rooted infinite graph. The key methodology in our analysis of the MFSC 
model (G n ) is that the randomly-rooted Q* n converge locally to a limit random 
infinite rooted graph Q*^, implying via l|37[) that 



In parallel. 



dens(G|g„) -> J?X( ^ l f°° ) := den Soo (G), say. (38) 

L{U ) 



dens (G\G n ) -» ^ X( ,^ |g ° o) := d"en Soo (G), say. (39) 

L((j ) 



In sections 14.51 and 15.21 we calculate dens(G) for several cases of G; let us 
record the formulas below. Note that limit densities may be infinite, in which 
cases we will point out the conditions on parameters needed for finiteness. 

(a) . For a directed path ir r with r > 1 edges, 

clensoo (7i>) = d r . (40) 

(b) . For the complete directed acyclic graph K r on r > 2 vertices, (that is, 
vertices {1,2,..., r} and edges i — > j for 1 < i < j < r), 

T— 1 

den Soo (K r ) = J] -A- (41) 

. t Pit 
u— l 

where p\ = /? and for general u > 1 

In particular, for the case of triangles K$ we have explicitly 



den Soo (K 3 ) = i ggggg ; ' (43) 

As already mentioned in section ^. 41 the formula above is the key ingredient in 
the formula for Cluster- Recall its verbal description 

The proportion of directed 2-paths v% — > v 2 — ► v 3 for which vi — > w 3 
is also an edge. 

This becomes 

dens 00 (if 3 ) 

^cluster = "= (44) 

densoo (7r 2 ) 



[low] 
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and then 1)4014411 immediately give the formula Q). 

(c) . For a directed path ir r with r > 1 edges, 

"—"^K d-Axf-A) )"'- (45) 

(d) . For the complete bipartite directed graph K 2 .2, for P2 < 5 (which always 
holds in the low density case) 

denS °° (K ^ 2) = (1-200(1- ft) ' (46) 

(e) . In principle one can calculate denSoo(G*) for any G*, but in practice it 
is not clear to what extent useful explicit formulas can be found - see section 
14.51 for further discussion, and for the observation that certain graphs G* have 
denSoo(G*) = 0, "asymptotically negligible density". 

4.5 Densities of induced subgraphs: easy explicit formulas 

Recall the basic result 1)3811 on subgraph density: 

dens(G|g„) -f SgffiJ := den Soo (G) 
i(G J 

where G* is an arbitrary rooting of G. In calculating the right side, to simplify 
notation we write 

x {G*)=EK{G*\gi ) 

and similarly for x(-) and X(-) and dens (G). 

First consider 7r r , the directed path with r edges, rooted at the last-arriving 
vertex, which we will call the head. Clearly x{tti) = ED on i = d. Let us write 
out the (rather obvious) inductive argument for calculating xi^r)- Whether or 
not the root vertex of Q*^ is the head of a r-path is determined at its arrival time. 
Consider a near neighbor Vi of the root, at distance £j. The expected number 
of r — 1-paths headed by Vi equals x(7r r _i). So the expected number of r- 
paths of the form root equals xi^r-i) x P((root, vi) is edge of Q*^). 

Summing over i gives 

XM = X(n r -i) x ED ont = x(TTr-i) x d 

and so xi^r) — d r by induction. This is formula l|4(J|) . 

The result for the complete directed graph K r on r vertices is similar. For 
r = 2 we have x{Kz) = x( 7r i) = 9 and so to establish formula (|4*T|l by induction 
it is enough to show 

x(K r+1 ) = X (K r )x Y ^-. (47) 

In the forwards space-time PWIT process, consider a vertex-set S r isomorphic 
to K r , headed by its latest-arriving vertex v*. At time t after the arrival of w*, 



24 



let N t — 1+ the number of if r+ i-subgraphs of the forwards space-time PWIT 
process which are of the form {v} U S r for some v; regard the as counting 

S r itself. Then N t is a Yule process of rate 

££>&)=& (48) 

i 

because for each vertex v counted in Nt, a new vertex v' arriving with near 
neighbor v at distance x has chance [p(a;)]' to create the r edges needed to 
make {v'} U S r be a K r +\ subgraph. Moreover these are the only ways in 
which a new K r+ i of the form {v 1 } U S r can be formed. By the Yule formula 

N t = Geo(exp(-i/? r )). Now regard K r +\ as rooted by its second-latest 

arriving vertex. In Q*^ the root has age T = Exp(l). At its arrival time the 
root headed some random number of K r 's, with mean x(K r ), so by considering 
the mean number of K r+ is at the present time 

x(K r+ i) = x(K r ) x (EN T - 1) 

= x(K r ) x (EeMTpr) - I) 

= *w x (r^ ~ 

giving igjjl. 

Other subgraphs. The derivations of formulas 145I46|1 dealing with paths and 
^2,2 are relegated to sections 15 . II and 15 1 21 

For the graph out — star r consisting of r out-edges at a root, it is clear that 

x(out — star r ) = E 

and similarly 

x(in — star r ) = E 

and these can in principle be evaluated using I|12l8fl . 
Some subgraphs have density zero. 




It is easy to see that the graph G* above (where no "vertical" edge is present) 
has x(G*) = 0. 





4.6 Reparametrization 

Writing a, A in terms of d, K — K c i ust0 r by solving I|6I7(I gives the formulas 

1 : 4k 1 0<k ^ (49) 
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A 



d((d+2)K-l) 

(3+1)(1+0k) 
(3+1)(1+3k) 



I 



d + 2 



1 



<«;<!. 



(50) 



29(1 -k) 



4.7 Edge-lengths 

The previous calculations have not made very extensive use of the time-dynamics 
of the forwards space-time PWIT process, and in particular have not used the 
fact that edge-lengths grow exponentially at rate 1. To derive the formula H17[l 
for edge length density we do need to exploit such time-dynamics. We consider 
only the low-density case; the high density case is more complicated because 
the distribution in (ii) below is no longer exponential. 

Consider the lengths of the in-edges at a particular vertex vq. Following a 
tradition in mathematical probability, we visualize an in-edge of length £ as a 
"particle" at position I on a line; we also put a particle at position to represent 
the vertex vq itself. If we start time r with r = at the arrival time of vq, then 
the evolution of the "particle process" can be specified as follows. 

(i) There is a particle at position at all times r > 0. 

(ii) For each particle (at position x at time r, say), at stochastic rate a per unit 
time a new particle appears at position x + Exp(A). 

(iii) particle positions increase deterministically at exponential(l) rate: a parti- 
cle at x at time r will be at xe T °~ T at time tq > r. 

Rule (ii) derives from l|22l) : for an existing edge (v',vq), a new vertex arriving at 
distance r from near neighbor v' creates an edge to vq with probability p(r) , so 
the rate at which each existing edge is copied equals J Q p(x) dx — a; moreover 
conditional on copying, the distance r has Exp(A) distribution, and so the length 
of the new edge equals the length of the old edge +Exp(A). 
To analyze this particle process of edge lengths, define 



We shall study 

f{x) dx = £?(number of in-edges at a typical vertex with length G [x, x + dx]). 
Because the age of a typical vertex has Exp(l) distribution, f(x) can be written 



G(r, x) 



E (number of edges of length > x at time r) 



so that 



9{t, x) 



— G(t, x) — mean edge-length density at time r. 



as 




The verbal description of the particle process leads to the equation 




2G 



Here the first term on the right expresses the deterministic exponential growth, 
the second term expresses birth of particles to parents not at (copying of 
existing edges) and the third expresses births to the O-particle (new edge to Vq). 
Multiply the terms of the equation by e _T and integrate out r; noting 

/ —G(T,x)e~ T dT= / G(t, x)e~ T dr = F(x), say, 
Jo dT Jo 

we obtain 

F(x) = xf[x) + a [ f{y)e-^ x -^ dy + ae~ Xx . (51) 



Differentiate: -/ = (xf)' + af - Xa f* f(y)e~ x( - x - v ^ dy - Xae~ Xx . 

Rewrite with the integral term on the left, and then substitute the integral term 

by the expression implied in 151JI : 

X(-xf - ae' Xx +F) = f+ (xf)' + af - Xae~ Xx . 

Differentiate: X(-(xf)' + Xae- Xx -/)=/'+ (xf)" + af + X 2 ae~ Xx . 
Tidy: (xf)" + X(xf)' + (1 + a)f + Xf = 0. 

Look for a series solution f(x) = J2n°=o a nX n ■ Equating coefficients of x n : 

(n + 2)(n + l)a n+1 + X(n + l)a n + (1 + a)(n + l)a n +\ + Xa n = 0. 
That is, 

a-n+i -X(n + 2) 



a n (n + l)(n + 3 + a) 

and so 

_ (-X) n (n + l)T(3 + a) 

a n ^7 — —5— — ^ a . 

1 (n + S + a) 

One can directly check that /(0+) = 1, identifying ao = 1. Because the mean 
in-degree is a/(l — a), the probability density function of a typical edge-length 
must be i^/(x), establishing fTTjl . 

5 Further calculations 

5.1 Yule arguments for subgraph density 

The next Lemma abstracts the Yule process arguments used in section 14.51 
Recall the reformulation there of limit subgraph density in terms of x{G) and 
X(G). 
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Figure 8. Illustration of Lemma 

Lemma 2 Let Go be a rooted directed acyclic graph such that each vertex is a 
descendant of the root. Let q be the out-degree of the root. Let G\ be a directed 
acyclic graph obtained from Go by adding an extra vertex w and edges (w, root) 
and c further edges from w to some children of the root (so < c < q). Then 

x(Gi)/x(Go) = T^— (52) 

1 - Pc+l 

x(Gi)/x(Go) = g(-i)j(*7 c ) (53) 

^ \ J /I- Pc+l+j 

(54) 



In particular, if q = c then 

X (G 1 )/x(Go) = T^—- (55) 

J- - Pc+l 

As a quick application let us derive formula 145fl for x(7r r ) for the directed path 
7r r on r edges. Applying l|53|) with q = 1, c — 0, 

/ \ / / \ ^1 01 — ^2 

X(7Tr+l)/x( 7r r) = 



l-/3i l-/?2 (l-A)(l-j8a) 

Because x( 7r i) = 9 and t(7r r ) = 1 we obtain formula Ij45(l . 

Proof of Lemma\^ We will do the harder case (|53|l . Consider a copy of 
Go (he. an isomorphic subgraph) at the root of the PWIT. In the space-time 
PWIT process, let M (t) be the number of copies of G\ which contain the given 
copy of Go, at time t after the arrival of the root. Since the age T of the root 
has Exp(l) distribution, 

x(G 1 )/x(G )=EM(T). 
Write S for the set of children of the root in Go and write A for a subset of S. 
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Consider the process of arriving vertices v which form an edge to the root. 
Such a v has a near neighbor v' , where either v' = root or (V,root) is already 
an edge. Writing A(v') C S for the set of children of the root to which v' creates 
an edge, then A(v) C A(v'). We can now write 

M(t) = #{w : (v,root) is an edge, A(v) = A±} 

where A\ is the set of children of the root of G\ to which w has an edge. 
In representing M(t) as above, we are using the hypothesis "each vertex is a 
descendant of the root" to ensure that, in a subgraph of the space-time PWIT 
isomorphic to G\, the last-arriving vertex must be w. 

Consider a sequence root = Vm, um, . . . , vus of arriving vertices such that 
each vertex vr/,,) arrives at distance x k from its near neighbor vr^-i)- The 
chance that each vua makes an edge to the root and to each child in A% 
equals Ilfc=i P c+1 ( x k)- The chance that furthermore no other child in S ac- 
quires an edge to equals (1 — Y[k=iP( x k)) qc ■ By considering the times 
< t\ < t 2 < ■ ■ ■ < ti < t of arrivals of vu> , 

EM(t) = J2 / dh.-.dti ... dx 1 ...dx l p c+1 {x)(l-p(x)) q - 

i=l J0<tx<...< Jti<t JO JO 

where p(x) = lY k=1 p(^)- Because p c+1 (x)(l-p(x))«- c = E^^^CDp^" 
and J °° p c+1 +i ( Xk ) dx k = Pc+i+j, 



oo j q-c , 



i=i 3=0 

q-c 



J c+l+j 



= EC" 1 )' ( 7 )(exp(/3«H-i +i t)-l). 
j=o \ J / 

Calculating EM (T) = e- l EM{t) dt establishes Q . 
5.2 Subgraph density of K 2 .2 

We have not pursued general methods for induced subgraph density beyond 
Lemma |21 but the argument that follows for the particular case of ^2,2, based 
on splitting into two cases, could clearly be applied somewhat more widely. 
We first quote 



Lemma 3 



^( Ge ° 2 (?,) ) = p- 2 -p-> (56) 
^( GC0( ^ 1 ) = ,- 2 -2 p - + 1. (57) 
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Next consider the graph G* on the left of Figure 9. 



G* . 




,V-l 




G 



root 



,V- 2 



Figure 9. Graphs related to K2,: 



We will show 



X(G*) = 



2(%d 



(1-200(1-00 



(58) 



where /3 2 = ^^ (2) . 

We start by repeating the argument in the r = 2 case of (|47|) . In the 
forwards space-time PWIT process, consider the newly-arrived root and an edge 
(root, u-i). At time t after the arrival of the root, let N t — 1+ the number of 
vertices v such that (v, root) and (v,V-i ) are both edges of the graph process; 
regard the as counting the root itself. Then Nt is a Yule process of rate 

P2 = / °° P 2 {x) dx. Thus at time t there are ( y ) graphs of the desired form 
containing the edge (root,u_i). Because the age T of the root has Exp(l) 
distribution, we see 

'N T - V 
2 



X(G*) = dE 



where d — ED ou t is the expected number of edges of the form (root, U-i). 
Using fify and lfS7|) . 



X(G* 



8E (e 2&T 
d(—± 



2e 



l-/3 2 



/3 2 T 



1 



leading to (JSSJ. 

Next, in consider 

Q := number of unordered pairs (v—i,V—2) such that (root,u_i) and 
(root, i>_2) are edges, and v-\ and u-2 were in different geometric 
components at the arrival time of the root. 

By considering distances r\ , r2 from the root to the near neighbors of the geo- 
metric components containing Vi,V2, 



EQ 



p(ri)p(r 2 ) drxdr 2 x9 2 = (3 2 d 2 /2. 
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Now consider in Q* x configurations G as on the right of Figure 9, where there 
is no edge between v-\ and V— 2 , and where the root is the first-arriving vertex 
to have edges to both V-% and V- 2 ] these requirements are equivalent to saying 
that at the arrival time of the root, u_i and V-2 were in different geometric 
components. Reuse a now-familiar argument. At time t after the arrival of the 
root, let N t = 1+ the number of vertices v ^ root such that (v, v^ 2 ) and [v, V-%) 
are both edges of the graph process; regard the as counting the root itself. 

Then N t is a Yule process of rate @2 = / °° P 2 (x) dx. Thus at time t the number 
of possible unordered pairs {v\,V2\ which give the configuration in the figure, 
where we allow one of {vi,i>2} to be the root, equals (^*). Because the age T of 
the root has Exp(l) distribution, we see 

x(G)=E^j xEQ 

where x(G) is the density of graphs as on the right of Figure 9, perhaps with 
extra edges, but subject to the requirement that the root is the first-arriving 
vertex to have edges to both V—% and v-2- Using (|28[) and (|57|) . 

n t\ ^/2S,T „s,t\ 1 1 



(l-2/3 2 )(l-/3 2 )' 
One can now write 

clens (K 2>2 ) - \x{K 2>2 ) = x(G*) + X (G) 

because a 4-vertex graph in Q*^ containing K2.2 is either of the form G* or is 
the restriction of a graph of the form G, in which the extra root is specified 
by the requirement stated above (the factor 1/2 reflects the fact (.(-^2,2) = 2). 
Combining the formulas above gives 

W*M) - (1 _^. ft) + ^ >< d-^d-ft) 
which simplifies to (|46|l . 



5.3 Directed percolation 

Here we record some calculations without detailed explanation. In the context 
of the space-time PWIT and the evolving random graph process <7^o(s), we can 
seek to grow a "core" graph C(s) inside £?^(s) via a greedy rule: 

a newly-arriving vertex is included in C(s) if it creates an edge to 
some vertex already in C(s), in which case all such edges are included 
in C(s). 

If this construction works, we expect the process (C(s)) to have a a stationary 
distribution C(0), say, where C(0) C Q* x . Consider 

q = P(root e C(0)) 

y = out-degree of root in C(0), given root 6 C(0). 
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Consider the relation 

oo 
i=l 

where we write Ber(p) for a Bernoulli(p) r.v. (taking value 1 with probability p 
and value otherwise) . Using the recursive structure of the limit random graph 
process, we see that q and Y solve the equations (for unknown < q < 1 and 
an unknown distribution Y on {1, 2, 3, . . .}) 

Y = dist(y|y > 1); q = P(Y>l). (59) 

Define Pdir- P erc( a ' A) to be the solution q if it exists, and to be otherwise. 
The interpretation of this quantity in terms of the finite random graph process 
(G n ,n > 1) is that 

n~ l ET -y Pdir-perc( a > A ) 
where T is the maximal size of a tree in Q n directed toward some root. So in 
particular, for p pe rc(a, A) defined at (JTSJl, 

Pdir-perc( a > A ) <Pperc(«,A). 

Equation l|59|) in principle determines Pdir-perc( a ' A), but to get an explicit 
bound we reuse an underlying idea. Because 1 + Yi > 2, 

oo 

Y > 5^Ber < (?)Bin 4 (2,i>(e i )) - Y* , say. 

i=l 

If the equation 

q = P{Y* > 1) (60) 

has a solution q > then one can argue Pdir-perc( a ' A) > q. But i|6Ufl is an 
explicit equation 



1 - q = cxp ( - y (2p(a;) - p 2 (a:))q dxj = exp(-(2/3 - /3 2 )g). 
If 2/3 — /?2 > 1 there is a solution g > 0, establishing l|19fl . 
5.4 Out-degree in the case aX — 1 

The special property of this case is that p(x) = e~ Xx . On the PWIT consider 

Y= Ber(e- Ad ^ root )). 
unroot 

This satisfies the same recursion (in the special case) as does £> ou t- But there 
is another way to study Y, which we sketch briefly. Either the root of the 
PWIT has no children within a small distance S; or it does have a child, and the 
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distances to the other descendants of the root and of this child are independent 
copies of the PWIT distances. Because the effect on Y of increasing distances 
by S is to censor each Bernoulli success with probability A<5, we see that Y 
is the stationary distribution of the continuous-time Markov chain on states 
{0, 1,2,.. .} with dynamics 

y — > y — 1 : rate Xy 
y — > y + Y + 1 : rate 1 

where Y is an independent copy of Y. One can now check algebraically that 

P(X = v) = (i-ftti) v , y>o 

solves the balance equations for this chain. That is, 1 4- D ou t has Geo(l — 4) = 
Geo(l — a) distribution, as asserted in H36|) . 

Remark. Antar Bandyopadhyay (personal communication) has given a purely 
analytic verification of (|36|1 . 

5.5 Triangle density of a function of degree 

Here we outline an argument for l|l(j|) ■ Because D- m has power-law tail and -D ou t 
has geometric tail, when D — D- m + -D ut is large, say k, then -D ou t = 0(1) and 
D} n = k — 0(1). It is then not hard to argue that the large-fc behavior for C(k) 
will be the same as for 

. . Mnumbcr of triangles with in- vertex vq I vq has in-degree k) 

Clk, = (5 

where the in-vertex of a triangle is the vertex with two in-edges. 
Recall from section that 

N(t) = number of in-edges at a typical vertex vq at time t after its 
arrival 

is the Yule process of rate (3. Write vi, V2, V3, . . . for the successive arriving 
vertices which create edges to vq, and for i > 2 write 

Mi = number of edges from Vi to {i>j_i, Uj— 2, ■ ■ ■ , v\}. 

After Vk arrives there are M2 + A/3 + . . . + Mk triangles with in-vertex vq . If we 
can show 

EMk — > b as k — > 00 

then we will have 

(«) 

Here we are sliding over the fact that 1 + Dj n is the Yule process evaluated at an 
independent Exp(l) time T; conditioning this to take a value k does not affect 
the properties used in the argument below. 

Suppose vertices v%, . . . , Vk-i have arrived and consider what edges will be 
created when Vk arrives. The dynamics (|22J) of the space-time PWIT say 
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the rate of arrival of new vertices with some one of vq, V\, . . . , Vk-i as 
near neighbor and at distance 6 [x, X + dx] from that near neighbor 
equals k dx. The index I of that near neighbor vj is uniform on 
{0,1,...,A;-1}. 

Such an arriving vertex creates an edge to vq with probability p(x). So condi- 
tional on that event (meaning the arriving vertex is Vk), the distance £ from the 
near neighbor vj and the index I = 1^ oi that near neighbor satisfy 

(i) i has probability density function p(x) = r p ^ x ] — ; 

/ p(u)du 

(ii) / is uniform on {0, 1, . . . , k — 1}. 

Because Vk will copy each of the Mi out-edges from Uj with probability 
each, and create an edge to Vi with the same probability, we obtain the recursion 

M k = Bin(l + M/,p(£)) 

where Mi = and where we interpret the right side as when I — 0. So the 
limit lim fc EM k = b solves b = (1 + b)Ep(£) and so b = ■ Finally, 

Ep{£) - / p{x)p(x) dx = 



Sob= -fy- and JSJl follows from (EB- 



6 Comparison with other models 

Recent complex networks models fall into two categories. In the small worlds 
models popularized by Strogatz and Watts, vertices are points in d-dimensional 
space, which automatically provides a metric distance between vertices, and 
the model uses some rule to create a random graph with short-range and long- 
range edges. In purely graph-theoretical models, such as the basic proportional 
attachment model popularized by Albert and Barabasi 7 the vertices have no 
"intrinsic structure" other than that provided by the graph; we visualize this as 
saying that each pair of vertices is metric distance 1 apart. In a metric copying 
model we visualize vertices as points in some abstract metric space, representing 
(in the case of web pages, say) the difference between the content of the pages, 
or (for people) some notion of "social distance" based on location, education, 
profession, interests etc of the individuals. In detail the mean-field model of 
distance model is used for mathematical tractability rather than any claimed 
realism. But the exponential growth of number of vertices with metric distance 
is intermediate between, and surely in many contexts more plausible than, the 
alternatives implicit in the two standard categories of model above. 

Within graph-theoretic models, the idea of distance preferences in attach- 
ment has been explored (see ^2] and citations therein). But the general idea 

7 but really just a minor variation of Yule's idea: see Lecture 4 of 121 



34 



of combining proportional attachment with metric geometry has scarcely been 
explored 8 , and the specific use of the mean-field model is novel. 

As a technical note, the mean-field model is a zero-parameter 9 model of 
distance. Our full network model has the two parameters (a, A); in contrast a 
typical small-worlds network model has four parameters (dimension, number of 
short-range links, constant and exponent for probability of long-range edges). 

As another technical note, the property (cf. I|16|l) C(k) ~ c/k has been 
proposed ^7] as a criterion for identifying networks which are "hierarchical" 
in some sense. But in our finitc-rt model (recall section I3.5|) each vertex has 
qualitatively the same behavior, rather than different vertices being a priori 
assigned different hierarchical roles. So our model is non- hierarchical, and we 
are inclined to regard the criterion as ineffective 10 . 

The specific model studied in this paper is intended as a "general purpose" 
model rather than being tuned to some particular subclass of real-world net- 
works. Having as one ingredient the now-familiar proportional attachment fea- 
ture, one could look at the many existing variant models in the literature and 
explore them within our platform. In other words, there are many ways to add a 
third parameter intended to express some presumed real-world feature or some 
theoretical desideratum. For instance 

• One can impose connectivity by requiring that a new vertex always links 
to its nearest neighbor. 

• one can add rules allowing a new vertex to immediately acquire in-edges, 
or for edges to randomly appear between existing edges. Such rules can 
be designed (as in e.g. section 11) to produce power law distributions 
for in-degree. 

6.1 Concluding remarks 

In this paper we have focused on 

• describing the model and its conceptual background (section 

• listing explicit formulas fsections !2 . 21 - 1^1))) and exhibiting the calculations 
which lead to these formulas (sections 01 and [SJ. 

We are postponing to a later paper consideration of 

• technical issues in the relation between the finite-n model and its infinite 
limit g*^ 

• the open problems indicated in sections 12.71 - 12.81 whose study requires 
the "bounding" techniques of theoretical mathematical probability rather 
than explicit calculations. 

s |14| gives a simulation study of an explicitly power-law model, as well as interesting 
empirical study of a notion of lexical distance between web pages 
9 Zero dimensionless parameters, to be pedantic 

10 One could alternatively regard it as indicating some subtle emergent hierarchical structure; 

cf. 



35 



Acknowledgement I thank an anonymous referee for detailed constructive 
comments. 



References 

[1] R. Albert and A.-L. Barabasi. Statistical mechanics of complex networks. 
Rev. Mod. Phys., 74:47-97, 2002. 

David Aldous. Notes on STAT 206. 

http:// w. stat.berkeley.edu/users/aldous/Networks/index. html 2003. 



David Aldous and Allon G. Percus. Scaling and universality in continuous 
length combinatorial optimization. Proc. Natl. Acad. Sci. USA, 100:11211- 
11215, 2003. 

D.J. Aldous. A stochastic complex network model. Electron. Res. Announc. 
Amer. Math. Sac, 9:152-161, 2003. 

D.J. Aldous and J. Pitman. Tree-valued Markov chains derived from 
Galton- Watson processes. Ann. Inst. H. Poincare Probab. Statist., 34:637- 
686, 1998. 

D.J. Aldous and J.M. Steele. The objective method: Probabilistic com- 
binatorial optimization and local weak convergence. In H. Kesten, editor, 
Probability on Discrete Structures, volume 110 of Encyclopaedia of Mathe- 
matical Sciences, pages 1-72. Springer- Ver lag, 2003. 

B. Bollobas and O. Riordan. Mathematical results on scale- free random 
graphs. In S. Bornholdt and H.G. Schuster, editors, Handbook of Graphs 
and Networks, pages 1-34. Wiley, 2002. 

B. Bollobas and O. Riordan. The diameter of a scale-free random graph. 
To appear in Combinatorica, 2004. 

P.G. Buckley and D. Osthus. Popularity based random graph models lead- 
ing to a scale-free degree sequence. Discrete Mathematics, to appear, 2004. 

S.N. Dorogovtsev and J.F.F. Mendes. Evolution of networks. Adv. Phys., 
51:1079-1187, 2002. 

S.N. Dorogovtsev, J.F.F. Mendes, and A.N. Samukhin. Structure of grow- 
ing networks with preferential linking. Phys. Rev. Lett., 85:4633-4636, 
2000. 

J. Jost and M.P. Joy. Evolving networks with distance preferences. Physical 
Review E, 66:036126, 2002. 

P.L. Krapivsky, G.J. Rodgers, and S. Redner. Degree distribution of grow- 
ing networks. Phys. Rev. Lett, 86:5401-5404, 2001. 



36 



[14] F. Menczer. Growing and navigating the small world web by local content. 
Proc. Natl. Acad. Sci. USA, 99:14014-14019, 2002. 



[15] M.E.J. Newman. The structure and function of complex networks. SIAM 
Review, 45:167-256, 2003. 

[16] M. Penrose. Random Geometric Graphs. Oxford Univ. Press, 2003. 

[17] E. Ravasz and A.-L. Barabasi. Hierarchical organization in complex net- 
works. Physical Review E, 67:026112, 2003. 

[18] S. Ross. Stochastic Processes. Wiley, 1983. 

[19] H. Tangmunarunkit, R. Govindan, S. Jamin, S. Shenker, and W. Willinger. 
Network topology generators: Degree-based vs. structural. In SIGCOMM 
02. ACM, 2002. http://topology.eecs.umich.edu/archive/sigcommo2.! 



[20] G.U. Yule. A mathematical theory of evolution, based on the conclusions of 
Dr J. C. Willis. Philos. Trans. Roy. Soc. London Ser. B, 213:21-87, 1924. 



37 



